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Abstract 

Replicating or caching popular content in memories distributed across the network is a technique to reduce peak 
network loads. Conventionally, the performance gain of caching was thought to result from making part of the requested 
data available closer to end users. Recently, it has been shown that by using a carefully designed technique to store 
the contents in the cache and coding across data streams a much more significant gain can be achieved in reducing the 
network load. Inner and outer bounds on the network load v/s cache memory tradeoff were obtained in HI. We give 
an improved outer bound on the network load v/s cache memory tradeoff. We address the question of to what extent 
caching is effective in reducing the server load when the number of files becomes large as compared to the number 
of users. We show that the effectiveness of caching become small when the number of files becomes comparable to 
the square of the number of users. 


I. Introduction 

In recent times, there has been an increase in demand for online video streaming leading to high data traffic. Also, 
it is observed that the demands are variable across time, with periods of high and low traffic demand. The load on fhe 
server is high during peak hours when a majority of users access video and relatively low at other times. Thus, there 
exists the possibility of storing content at the end users during the off peak hours such that the load on the server 
is reduced during peak hours. This method is called caching. There are two main phases involved in this process, 
placement phase and delivery phase. In the placement phase, data is stored at the end user when the network is 
relatively uncongested; here the constraint is the cache memory size at the user. Also, at this stage the actual request 
the user might make is not usually known. In the delivery phase, when the actual requests of the users are made, the 
constraint is the rate required to serve all the requested content. 

A straightforward approach is to cache a copy of a fraction of all the files af all fhe users. Then in fhe delivery 
phase, the central server needs to send only the remaining parts of the requested files. This is effective only when 
fhe cache size is comparable to the database size at the server. 

A more sophisticated approach is to allow the central server to satisfy the request of several users with different 
demands with a single multicast stream as was shown in iH using the idea of network coding fU. Streams are 
generated by coding across the different files requesfed. This reduces fhe rafe as compared fo a convenfional caching 
scheme. The requested files are decoded from the data stream using the contents stored in the local cache memory. 
The gain from this approach is not only proportional to the cache size but also increases with the increasing number 
of users. Another approach suggested in |[T| is to store contents that are coded across files fo reduce fhe rafe. 

In im, inner and oufer bounds on fhe optimal Iradeoff befween cache size M af each user and fhe dafa rafe R 
required fo service any set of single file requests from all the users were obtained. Considering a popularity distribution 
on the files, inner and outer bounds on fhe Iradeoff between cache size and expected load of the shared link was 
obtained in |[3l. An online version of this problem was considered in lUl. In Q, a scheme was proposed where 
the placement phase is distributed and not centrally controlled by the central server. In 161, a hierarchical system is 
considered, where caching happens at two or more levels. 

In this paper, we are interested in the case when the database size is large compared to the number of users. For a 
fixed cache size, when fhe number of files is considerably large compared to the number of users, no significant gain 
in the rate can be achieved by any scheme compared to having no cache. Specifically, we are inferesfed in finding 
fhe minimum number of files beyond which fhe benefifs of caching disappear in fhe selling of iH. To Ihis end, we 
firsl prove a general outer bound on the optimal (M, R) tradeoff which generalizes an example in H]. We show that 
the gains from caching are small when the number of files is comparable to the square of the number of users. We 
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then define the pre-constant to the ©(iC^) term (where K denotes the number of users). Using the improved outer 
hound we obtain a better upper bound to this pre-constant. 

The rest of the paper is organized as follows. In Section II we recapitulate the system model proposed in |[T1, 
and in Section III we summarize the different caching strategies proposed there. We derive a new outer bound on 
the tradeoff of cache size and rate in Section IV by generalizing an example in ||TJ. In Section V, we calculate the 
minimum number of files beyond which benefits of caching become small. We finish with a short discussion in 
Section VI. 


II. System Model 

Consider a system (see Fig. 1) with K users connected to the central server through a shared, error free link. 
The server has access to the database containing N files VFi,..., ITat, of F bits each, all independent and uniformly 
distributed. Each user has access to a cache of size MF bits for some real number M G [0, A^]. In the placement 



Fig. 1. Caching system consisting of N files at the server, K users each having a cache of size M files as in [1]. 

phase, the user fills the content of its cache by accessing the database. In the delivery phase, user k requests one of 
the files from the database. The server knows all the requests and transmits a signal of size at most 

RF bits, where we call R the rate and {di,...,dK) the file request vector. Using the content Zk of its cache and 
the signal received each user k must decode its requested file Wd^. For the rest of the paper we will be 

expressing R and M as well as entropies and mutual informations in units of F bits. 

Definition 1. The memory-rate pair (M, R) is achievable if for every e > 0 and every large enough file size F there 
exists an (M, R) caching scheme such that the probability of error in decoding the required file is less than e for 
each request vector. We define the optimal memory-rate tradeoff as 

R*{M) = inf{i? : {M,R) is achievable}. 

III. Caching Strategies 

We summarize the three strategies given in iH. Here, coding refers to taking linear combinations of the requested 
files. 

A. Uncoded Caching 

There is no coding involved in this strategy. Each user caches ^ fraction of each file in the placement phase and 
in the delivery phase the 1 — ^ fraction of the file that is not available to the user is transmitted by the server. Since 
there are N files, and the size of each file is F bits, the cache size of each user is MF bits. In the worst case, when 
no two users request the same file, for each of the K users, the server needs to transmit the remaining 1 — ^ part 
of each file. This gives an achievable rate Ru{M) which is, 

F[/(M) = iT 1^1 - .minjl, . (1) 
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There are two factors, K which is the rate without caching and 1 — which is the gain because of the availability 
of caches at the end user referred to as local caching gain. When the number of users is more than the number of 
files then an additional gain of ^ is obtained. 

B. Coded Caching 

In this strategy, as mentioned before, the aim is to multicast (combine various files meant for different users) in 
the delivery phase. In the placement phase, each file is divided into (j^) equal-sized parts, and each user caches 

bits of each file such that every set of users have one part of each file in common. For the delivery phase, 
consider any set of + 1 users. Each user in the set will require a part of the requested file that is present at the 
remaining users in the set. The central server sends a linear combination of all the + 1 requested parts. 
Similar linear combinations are sent by considering all possible sets of + 1 users. This gives an achievable rate 
Rc{M) of HI, 

( 2 , 

In addition to the local caching gain as explained in section IIII-AI coded caching achieves an additional gain of 
which is the global caching gain. 


C. Coded Content Placement 

The achievable rate of section IIII-RI can be further improved by coded content placement. For M = ^, coded 
content placement strategy has a lower rate compared to coded caching strategy which improves the rate in the region 
M = ( 0 , 1 ). We illustrate this with an example. Consider the case of = iT = 3 and M = 1 / 3 . In this strategy, we 
split the three files A, B, C into three sub files i.e., A = (Ai, A2, A.3), B = {Bi, B2, B3) and C = {Ci, C2, C3). The 
caches are stored with Zi = Ai ® Bi ® Ci, Z2 = A2 ® B2 ® C2 and Z3 = A3 ® B3 ® C3. Consider that user one 
requests file A, user two requests file B and user three request file C. The server satisfies the requests by transmitting 
(i^i, Cl, ^42, C2, A3, B3) at rate R = 2 which does better than the achievable rate Rc{M) given by (O as shown in 
Fig. 2 . 


IV. Lower BOUND ON 22*(M) 

In this section, we first summarize the cut-set bound of m and then give an improved bound. 


A. Cut-Set Bound 

Let s € {1, ..., min{V, iF}}. Consider 2f(i 2,...,s), which is transmitted during the delivery phase, on the shared link 
when the first s users request files 1, 2,... , s, respectively. Then, 2f(i 2,...,s) along with the caches Zi, ... , Zg oi the first 
s users must determine the files Wi, ...,114. In a similar manner consider 2^), •••, 

Now 2f(^2,...,s),-",2f((dJ-i)^-ti,...,(LN/sJs)) and Zi,....,Zg must determine IFi,...., FFdJs- Since [N/s] trans¬ 
missions of size R and s caches of size M determines s[V/sJ files we have. 


[iV/sj 22 *(M) + sM > s[N/s\. 


Solving for R*{M) and optimizing over all s, we obtain 


R*(M) > max 

sG{ 1,.,min{A^,X}} 



(3) 
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B. An Improved Bound - An Example 

In this section, we give an example to illustrate how the lower hound on R*{M) can he tightened compared to 
the cut-set hound (l 3 ]l hy generalizing the approach used in H] Appendix]. 

Example 1. Consider the case of = 9 files and K = A users. We consider Ari245,X3167, Ar89i2 and X7431, the 
signals transmitted hy the server for the request vectors ( 1 , 2 , 4 , 5 ), ( 3 , 1 , 6 , 7 ), (8, 9 , 1 , 2 ) and ( 7 , 4 , 3 , 1 ), respectively. 
Wi can he decoded hy user 1 using its cache Zi and X1245. Similarly, user 2 can decode file Wi using Z2 and X^iq^. 
In the same way, users 3 and 4 can decode file Wi from their caches along with X^qi2 and Ar743i, respectively. 
Now, notice that W2 and 1^3 can he decoded hy combining X1245, ^3157 and the caches Zi of user 1 and Z2 of 
user 2 . Specifically, user 1 with its cache Zi and X3167 can decode file W3 and user 2 with its cache Z2 and X1245 
can decode file W2. In the same way, files W2 and W3 can also be decoded by combining X8912, ^^7431 and the 
caches Z3 of user 3 and Z4 of user 4 . This combining refers to step (b) in the chain of inequalities below and is key 
to obtaining our lower bound. The remaining files {W4,W5,WQ,WY,Ws,Wg) can be decoded by taking all the 4 
request vectors together and using the corresponding cache of the user that requests that file. The steps given below 
demonstrates this procedure. Recall that R, M, entropies, and mutual informations are all in units of F bits. For any 
achievable memory-rate pair {M,R), (below we suppress the small terms resulting from Fano’s inequality) 


4M + 4i? > 77 (Xi245, ^ 1 ) + ^(^3167, Z 2 ) + H{Xs 912 ,Zs) + H{Xu31,Z4) 

= H{X42A5,Zl\Wi) + I{Wi;X 42A5,Z4) + H{X3ie7,Z2\Wi) + I{Wi-,X3167,Z2) + 
H{Xs912 , Z 3 1 IFi) + /(lEi; X8912, ^ 3 ) + 77(X7431 , Z 4 1 lEi) + /(kFi; X 7431 , Z 4 ) 

(a) 

> H{Xi245,Zi\Wi) + H{Xsie7, Z2\Wi) + 77(X8912, ^3|Wl) + H{X743uZ4\Wi) + 4 


(b) 


> iT(Xi 245 ) ^ 1 ) -^ 3167 ; ■^2|W^l) -|- F[{Xsgi2, Z3, X7431, Z4|kFi) -|- 4 
= 77(Xi245 ,Z4,X3ie7,Z2\Wi,W2,W3) + I{W2,W3; Ai245, , ^3167, ^21) + 

H{Xs912 , Z 3 , X743I ,Z 4 \Wi,W 2 ,W 3 )+I{W 2 ,Ws; Xs912 , ^3, ^7431, ^41 ) + 4 


> H 


Xl 2 A 5 ,Zi, X3167 ,Z 2 , 
-^ 8912 , ^ 3,-^7431 j ^4 



+/(IE2, IF3 ; ^1245, ^1, ^3167, ^2|^Ei) + 


/(IE2, W3; X8912, ^3, ^7431, ^41 ) + 4 


(c) 

> I 


( W 4 ,W 5 ,W 6 ,W 7 ,W 8 ,Wg; 

-^1245 ,Zi, X3167 ,Z2, 
\ -^ 8912 ,-^ 3 ;-^ 7431)-^4 



14 , 


where (a) follows from Fano’s inequality since Wi can be decoded from each of (-A1245, Z4), (A3167, Z2), 
(X8912,.^3) and (X743i,Z4), and (b) holds because 

77(Xi 245, ^ll^El) + 77(^3167, ^2 |H"i) > 77(Xi245, ,773167, ^2|kFl), 

77(X8912,^3|W^i) +77(777431, ^4|VFi) > 77(778912, ^3, 777431 , ^4|W^l). 

Similarly (c) follows from Fano’s inequality because 1 + 2,443 can be decoded from each of (X 1245 , ^ 1 ,773167, .^ 2 ) 
and (778912,-^3,777431,^4). Similarly, (d) holds because (1+4,1+5,416,147,418,4+9) can be decoded from 
(771245,^1,773167,^2,778912,^3,777431,^4). Combining the above results we get, 

M + R*{M) > 3 . 5 . 

This is an improvement over the cut-set bound which gives M + R*{M) > 3 . The coded caching achievable strategy 
gives infM>o M + Rc{M) = 3.75 at M = 2 . 25 . □ 


C. General Lower Bound 

Our main result is the following lower bound on the optimal (M, R) tradeoff. Recall that M, R are in units of F bits. 
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Theorem 1. 

For a > 0 and K >2 users, if (M, R) is achievable, 

(i) then for iV > [i], 

2(n - 7), ^ ( 3 (ra - if - (n - 7) + l) 


aM + R> { 


where, 


n = 




7il 


7 = max 0, 


K 


n — 


(ii) then for iV < [i], 


N 

(y]\f[ -\- R ^ p 1 

I a \ 


For a > 1 and K >2 [aj users, if (M, R) is aehievable, 

(i) then for N > [aj, 

' jV-H((n- 7 F-(n- 7 )+l) ^ ^ ^ 3 ^^ _ ^^2 _ 

2 (n - 7 ) L«J > ^ > L«J (3(n - if - (n - 7 ) + l) 


aM + R> 


where. 


n = 


[crj + \J~\a\ + 12 [q;J {^N — [cij) 


7 = max I 0 


n — 


6 [aJ 
K 


(ii) then for iV < [aJ , 


aM + R>N. 


(4) 


(5) 

( 6 ) 

(7) 


( 8 ) 


(9) 

( 10 ) 


( 11 ) 


A proof is given in the Appendix. The next example also shows that, in general. Theorem 1 is tighter than the cut-set 
bound (I 3 ]l. 

Example 2. Consider the case of = 3 files and K = 3 users. The cut-set lower bound (| 3 ]), the lower bound of ([U) 
for a = 1 , and the achievable tradeoffs of ([ 1 ]) and dH) are shown in Figure 2 . 


V. Critical Database Size for Efeective Caching 

For any caching system, if the number of files grows we expect the reduction in R to be small, for a fixed number 
of users K and cache size M. In general, each user may find only a small fraction of the file requesfed in its cache. 
This results in the server having to send a significant part of the requested file in mosf cases. So the decrease in rate 
R for a fixed M is negligible. Hence, having a large database decreases the benefits of caching. 

To find the minimum database size for a fixed number of users for which caching becomes ineffective, we consider 
the quantity {aM + R*{M)), which arguably measures the cost of operating a caching system, where a > 0 is the 
relative cost of cache memory (per user) versus server bandwidth. Clearly, 

inf {aM + R*{M)) < K, 

M>0 
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Fig. 2. The (M, R) tradeoff for Af = 3 files and fT = 3 users. 

since R*{M) = K for M = 0 , as the central server must serve the whole file when there is no cache. We are 
interested in finding the smallest size of the database, such that infM>o (ctM + R*{M)) = K. 

Definition 2 . For any K users and a > 0 , N{a,K) is the minimum number of files such that 

inf (aM + R*(M)) = K. 

M>0 

The following three lemmas give upper and lower bounds on N {a, K). Lemma 1 uses the cut set bound to derive 
an upper bound on N{a,K). An improved upper bound using Theorem 1 is given in Lemma 2 . A lower bound on 
N{a,K) using the coded caching achievable strategy of fTl is given in Lemma 3 . 

Lemma 1. For K users and a > 0 , 


N{a,K) < 



a 


Using the lower bound we derived in Theorem 1 , we can improve upon this bound. We illustrate this with an example. 


Example 3 . Consider the case when there are AT = 4 users and instead of 9 files considered in Section IIV-BI 
suppose we increase the number of files to N = 11 . Following the same procedure as in Example 1 , we get 

M + R*{M) > 4 . 


Thus upper bound on 


N{a,K) = 11 . 


This is an improvement compared to Af = 16 files given by Lemma 1 . 


□ 


Lemma 2. For K >2 users and a > 0 , 


N{a,K) < 




K' 

~2 


For K >2 [aj users and a > 1 , 


N{a,K) < [aJ 3 


K 


-I 2 


+ 1 


K 


+ 1 




Lemma 3 . For K users and a > 0 , 
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The proofs of the lemmas are given in the Appendix. From the lemmas it is clear that N{a,K) = Thus, it 

is important to characterize the smallest pre-constant to the term which is concretely defined as, 


Pa 


A 


lim 

K^oo 


N{a,K) 


The following theorem directly follows from the lemmas. 

Theorem 2 . For any K users, a > 0 and N{a,K), /?„ is hounded hy 


,«>!• 

Since the minimum number of files N{a,K) such that infM>o (“-^ + ^*(-^)) = iT is of ©(AT^), we can 
conclude that the effectiveness of caching becomes small when the number of files becomes comparable to the 
square of the number of users. 


VI. Discussion 

In this paper, we consider the case when the number of files is large compared to the number of users in a caching 
system. First, we studied inner and outer bounds on the memory-rate tradeoff and present an improved outer bound 
by generalizing the approach used in ||T1. We showed that when the number of files is comparable to the square 
of the number of users, the benefits of caching become negligible. We defined the /?„ to be the pre-constant to the 
term. Using the improved bound, we obtain a better upper bound to this pre-constant. 

We studied the worst-case shared link load (as in HI). We expect similar results to hold for the expected load of 
the shared link under popularity distributions on files with a large number of popular files. 
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Appendix 

Proof of Theorem I 

We will first obtain a lower bound on (M -|- R), for any achievable {M,R), i.e., the case of a = 1 . For this, we 
first consider the case of iT > 2 n, where n is as defined in dS]). Note that 7 of dH) is 0 in this case. Recall Example 
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1 where 4 request vectors were considered. Similarly, we consider the following 2 n request vectors 



• • J ^n—li ) • • • ) '^ni til ■ ■ ■ 1 tK—2n) 

(12a) 


. . . . . ,tj^_2n) 

(12b) 

K,... 

5 ^n—1 ? -*■? ? • • • ? ? • • • ? ^K—2n) 

(12c) 


...n+l 1 1 1 ^n-l-1 'i 

(12d) 


2n n n -i f2n f2n \ 

• 1 ) “1) ■ ■ ■ ) “n—1) -*-Ul ) ■ ■ ■ ) ^K—2n) 

(12e) 


Of these, we require that 1 , u\, ..., • • •, u”,..., u^_i he distinct. Hence, we will require that — n + 1 < N. 

Furthermore, we want these along with the u’s, i.e., 1 , uj,..., u^_i, ..., u'^, ■ ■ ■, u^-i, vl, - ■ ■ ,Vn, ..., vf ^,..., 
to include all of 1 , 2 ,... , A^. Hence, we need n to he such that 

ri^ — n + l<N< 3 n^ — n + 1 . ( 13 ) 


We can verify that the choice of n in (l 5 ]l, which is reproduced helow, satisfies this. 

1 + + 12(A^ — 1) 

6 ■ 


Consider the first request vector and the first n users. User 1 requests file Wi, and the rest n — 1 users request 
files (w^i ,..., Similarly, in the second request vector, user 2 requests file Wi and the rest n — 1 users 

request files ..., In the same manner for the n-th request vector, user n requests file Wi and the 

first n — 1 users request files (VFnj,..., These (VFi, VF^i,..., Wu^_J are — n + 1 distinct files in the 

database. For the second set of n request vectors, users n + 1 to 2 n request the same files as users 1 to n in the 
first n request vectors. For the first n request vectors, users n + 1 to 2 n requests files (FFyi,..., fCy")- For fhe 

second n request vectors, users 1 to n requests files ..., By our choices we have ensured thaf 

these 2 n^ files contain the remaining N — {v? — n + 1 ) distinct files. 

We now follow the same procedure as in Example 1 . First file Wi can he decoded from all the 2 n request vectors. 
This is done hy considering the first request vector and cache Zi, the second request vector and cache Z2 and so 
on for the remaining request vectors. Then, the first set of n vectors and the second set of n vectors are separately 
combined to decode files ..., J. From the first n request vectors and caches [Zi,..., Z^) the files 

(VF„i,..., VFu"_ J can be decoded. Similarly, from the second set of n vectors and [Zn+i, ■ ■ ■, ^2n) the same set 
of files can be decoded. The rest N — {v? — n + 1 ) files which are included in [W^i ,..., lUuj) can be decoded by 
considering all the 2 n request vectors together using all the caches {Zi,, Z2n)- All R, M, entropies and mutual 
informations are in units of F bits and, as before, we suppress small terms from Fano’s inequality. So for any 
achievable memory-rate pair (M, R) and K > 2 n, 


2 n{M + R) >H ^1) + • • • + 


H X 


Zn ) + 


(i) 

> H 


"(< V,<-l4,if v4K-2n)’ 






\ Zi, . . . , Zn 


VFi + 








H 

^ -^n+l; • ■ • ; 

VEi j 

+ 

2 n 




\ 

IVg 


W 


\ ■Z’l, . . . , Z2n 

/ 


+ 2n + 2n(n — 1) 

> 2n^ + (_N — (n^ — n + 1)) , 


where (i) is similar to steps (a) and (b) together in Example 1 . In step (ii), which is similar to step (c) in Example 1 . 
We define 






Step (iii) is similar to step (d) of Example 1 . Therefore, for K > 2 n, 


M + R > n + 


N — [n? — n + 1 ) 
2n 


( 14 ) 


Notice that 7 = 0 for iT > 2n, and the definition of n is such that N < 3n^ — n + 1. Thus we have proved the 
theorem for a = 1, iT > 2 n. 

When K < 2 n, we defined 7 > 0 as the smallest integer such that K > 2(n — 7 ). Notice that since K > 2 , 
(n — 7 ) > 0. Recall that we had considered 2n vectors. Now we consider 2(n — 7 ) request vectors. We follow the 
same steps as above with n replaced by n — 7 . Eor this, we will now need N to satisfy (cf. (fTSll l 


(n — 7)^ — (n — 7) + 1 <A^< 3 (n — 7)^ — (n — 7) + 1 . 


It is easy to verify that the left inequality follows from the definitions of n and 7. Hence, for N < 3 (n—7)^ —(n—7)+!, 


M + R > {n 


. I iV-((n-7)^-(n-7) + l) 
2 (n- 7 ) 


Eor K < 2 n and N > 3(n — 7)^ — (n — 7) + 1 , we proceed as before, but now the number of files N is larger than 
the number of indices u’s, u’s, and 1 . We may set them all to be distinct files and hence, in sfep (iii), insfead of 
decoding N — {{n — 7)^ — (n — 7) + 1 ) files, we now have (3(n — 7)^ — (n — 7) + 1 ) — ((n — 7)^ — (n — 7) + 1 ) 
files. Thus, 


M + R> {n 



1 ? 


= 2 (n- 7 ). 


(n-7) + 1) - ((n-7)^ 
2(n - 7) 


(n - 7) + 1) 


This completes the proof for a - 
Eor the case of it' > 2 n (i.e., 7 
analogous to (fTSl) is now 


1 . Eor generalizing this to any a > 0 , we first consider the case of 
: 0 ), we consider [sets of 2 n request vectors similar to (fT^ . The condition 


1 

a 


(n^ — n + 1) < N < 


1 

a 


(3re^ — n + 1 ) , 


( 15 ) 


which can be verified to hold for n as defined in ([ 5 ]) with 7 = 0. Now, in step (i), files can be decoded by 
decoding one file from each of sefs of 2 n requesf vectors. Then, in step (ii), we may now consider 2 sets 
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of n vectors each such that n(n — 1 ) files can he decoded from each set. The remaining — n + l) can 

he decoded hy combining all the \ 2 n vectors. Hence for K > 2 n, 


2 n \ M + 


R 1 > 

> 2 


( 2 n) + 


n^ + N - 


(2n(n — 1 )) + N — 
(n^ — n + 1 ). 


(n^ — n + 1 ) 


Since aM > we have, for K > 2 n, 


r^i 


aM + R> n + 


^ {n'^ -n + l) 


2 m 


( 16 ) 


n 


The proof for K < 2 n is along the same lines as for a = 1; as above, we now work with 2(n — 7 ) request 
vectors instead of [ 2 n. 


When -/V < [7 
between these \ - 

\ a 


we consider [request vectors such that one of the users, say the first user, requests all N files 
request vectors. From this we get, M +\^~\ R> N which gives aM + R 

-tIt- This completes 

the proof for a > 0. 

Now we prove the second part of the Theorem 1 when a > 1. We first consider the case of iT > 2 \_a\ n, where 
n is as defined in ®. Note that 7 of (fTOl) is 0 in this case. Now consider the following 2n request vectors. 


^ 1 , . . . , [aj ,Ui, . . . , — ) • • • > ) ^K— 2 n[a\^ 

(2 2ii|2 2 2 2 j.2j.2 \ 

. . . , , 1 , . . . , [aJ , ■ ■ ■ , «nH-H > • • • > ^nH > > ^K- 2 nyc.\) 


( 17 a) 

( 17 b) 


• • • ! L“J >'^1) • • • ■ ' ' ^^K-2n[ai) 


( 17 c) 

( 17 d) 



.,2n 

^n[a\ 




n 

nla\ — \_a\ 


j.2n 




2n 

K—2n [qJ 


( 17 e) 


Of these, we require that 1 ,..., [aJ ,uj,...,, ■ ■ ■, Ui, ■ ■ ■ ^'u^yal-ya] distinct. Hence, we will require 
that [aJ (n^ — n + 1) < N. Furthermore, we want these along with the u’s, i.e., 1 ,..., [aJ ,u\,..., j • • •, 

..., • • • > '^nLaJ > • • • ’ ^1"’ • • • > include all of 1 , 2 ,... , iV. Hence, we need n to be such that 

[aJ {v? — n + 1) < iV < [aJ ( 3 n^ — n + 1 ). 


We can verify that the choice of n in ®, which is reproduced below, satisfies this. 

L«J + VL«jmi2LaJ (iV-H) 

n = - - -;—;- . 

6 [aJ 


Consider the first request vector and the first n [aJ users. Users 1 to [aJ request files Wi fo VF|^qJ and the rest 
n \ a\ — [aJ users request files (w^y , • • •, ^ ^ ^ ^ ^ Similarly in the second request vector, users [aJ +1 to 2 [aJ 

request files VFi fo VFj^aJ nnd the rest n [aJ — [aJ users request files (Wu2, ..., ^ ^ ^ This proceeds in the 

same manner until the n-th request vector. These ^VFi,..., VF|^aj, W^y , ■ ■ ■, — n+ 1 ) distinct 

files in the database. For the second set of n request vectors, users n [aJ +1 to 2 n [aJ request the same files as users 
1 to n [aJ in the first n request vectors. For the first n request vectors, users n [aJ + 1 to 2n [aJ requests n^ \ a\ 
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files (Wyi ,..., For the second n request vectors, users 1 to n requests [aj files • • •, 

By our choices we have ensured that these 2 n^ [aJ files contain the remaining N — [aJ {v? — n + 1 ) distinct files. 

We now follow the similar procedure as in the case when 0 < a < 1 . First files Wi fo W\a\ can he decoded from all 
the 2 n request vectors. This is done hy considering the first request vector and caches Zi to the second request 
vector and caches to .^2 [qJ and so on for the remaining request vectors. Then, the first set of n vectors and 

the second set of n vectors are separately combined to decode files (w^i,..., From the first n request 

vectors and caches (Zi,..., the files (Wu\-, ■ ■ ■, can he decoded. Similarly, from the second set 

of n vectors and (Z^i^^iJ+i) • • •) ^2n[aj) the same set of files can he decoded. The rest N — [aJ {v? — n + 1 ) files 
which are included in (Wyi ,..., W^2ri ^ ^ can he decoded hy considering all the 2n request vectors together using 
all the caches (Zi,..., Z2„[aj)- R, M, entropies and mutual informations are in units of F hits. So for any 
achievable memory-rate pair (M, R) and K > 2 n [aJ, 

2n ([aJ M + R) >2n [aJ + 2n{n [aJ — [aJ ) + N — [aJ (n^ — n + 1) 

> 2 \ oi\ + N — [aJ (n^ — n + 1 ). 


Since a> [aJ, for K > 2 n [aJ, 


aM + R> n [aJ + 


N — [aJ (n^ — n + 1 ) 
2n 


The proof for K < 2 n [aJ is similar to the case of 0 < a < 1 . Here we find the least integer 7 such that 
K > 2 (n — 7) [aJ. Notice that since K >2 [a\, (n — 7) > 0 . Now we consider 2 (n — 7) request vectors instead of 
2 n. For this, we will now need N to satisfy 

[aJ ((n - 7)^ - (n - 7) + 1) < N < [aJ ( 3 (n - 7)^ - (n - 7) + l) . 


It is easy to verify that the left inequality follows from the definitions of n and 7. Hence, for 

N < [aJ ( 3 (n - 7)^ - (n - 7) + l). 


aM + R> {n 


7) [aJ + 


N — [aJ ((n — 7)^ — (n 
2 {n - 7) 


7) + 1 ) 


For K < 2 n [aJ and N > [aJ (3(n — 7)^ — (n — 7) + 1 ), we proceed as before, but now the number of files N 
is larger than the number of indices u’s, u’s, and 1 , ..., [aJ. We may set them all to be distinct files and hence, in 
step (iii), instead of decoding N — [aJ ((n — 7)^ — (n — 7) + 1 ) files, we now have [aJ (3(n — 7)^ — (n — 7) + 1) — 
[aJ ((n — 7)^ — (n — 7) + 1 ) files. Thus, 


aM + i? >(n — 7) [aJ + 
=2(n - 7) [a\ . 


[gj ( 3 (n - 7)^ - (n - 7) + 1 ) - [gj ((n - 7)^ - (n - 7) + 1 ) 

2(n - 7) 


This complefes the proof for K < 2 n [gj. 

When N < [gJ we consider [gJ caches such that among them all K users are included. We consider one request 
vector where among the users all the N files are requested. Since N < [gJ from the [gJ caches all the files can be 
decoded, we gel aM + R> N. This completes the proof of Theorem 1 when g > 1 . 


Proof of Lemma 1 . 


Proof of Lemmas 


Using equation (O, by substituting and s = K, 


M 

j^+R*{M) > K 

\ a \ 

aM + R*{M) > K 
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which gives, 


Proof of Lemma 2. 


N{a,K) < 


K‘^. 


□ 


This proof follows from Theorem 1. Consider the case when K is even and a > 0. We want to show that for 


N = 


K 


4 2 ^ ^ ’ 


(18) 


the lower hound of Theorem 1 gives aM + R > K. To see this, substitute N from (ITSl) in ([5])-® to see that n = ^ 
and 7 = 0. Then, the lower hound of ([U) indeed gives aM + R> 2n = K. Hence we have for even K, 

1 


N{a,K) < 


a 


K ^ 


To handle odd K as well, we note that N{a,K) is a non-decreasing function of K for fixed a. Hence for a > 0 
and K > 2, 


N{a,K) < 




K' 

~2 


+ 1 


Following the same procedure for a > 1 we first consider K to he such that K = 2n [aj. We choose N to he, 

3^2 K 


N = 


4 LaJ “ T + L“J 


Then, the lower hound of ® gives aM + R > 2n [aJ = K. To find for any K, we note that N{a,K) is a 
non-decreasing function of K for fixed a. Hence for a > 1 and K >2 [aJ, 

/V 

+ 1 • 


N{a,K) < [aJ 

Summarizing for iF > 2 users and a > 0, 

N{a,K) < 

For K >2 [aJ users and a > 1, 

N{a,K) < [aJ ^3 

Proof of Lemma 3. 


K 


2L«J 


2L«J 




K' 

~2 


+ 1 


K 


n 2 


K 


+ 1 • 


□ 


To find fhe minimum number of files such fhaf {aM + Rc{M)) is K for fhe coded caching sfrafegy explained 
in section IITI-RI notice that. 


aM + Rc{M) =aM + 


r (1 - 


1 + 


KM 


=aM + 


N 

KN - KM 
KM + N ■ 


Since M takes only those values for which G{l,2,...iT}as defined by fhe coded caching sfrafegy we subsfifufe 


MK 

N 


= 1. Solving fhis we obfain. 


.r 1 K K 

iV = --^- 

a \ 2 2 


(19) 
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To show that for all N less than (fT9l ). the scheme satisfies aM + Rc{M) < K, consider N 
M = ^ and substitute in aM + Rc{M). We get, 


aM + Rc{M) 


= (J.1V1 -f 


KM + N 
_aN ^ K{1 - 1/K) 

^ 2 

aiK'^ + K) K-l 


2aK 
K + 1 K - 1 
^ 


<K. 




Hence, 




□ 










